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© A method is disclosed for converting both a 
knowledge base and an inferencing technique into 
compilable program code forming a knowledge 
based system. The method is based on determining 
what actions an interpretitive inference engine would 
take with the specific knowledge base and generat- 
ing only the program code needed to perform these 
actions. This method eliminates the overhead of 
interpreting a representation of the knowledge base 
and significantly improves performance of the sys- 
tem. The method is applicable to both forward and 
backward chaining reasoning strategies and does not 
modify or restrict the functional capabilities of the 
knowledge based system. . 
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IF X = l AND Y<I0 THEN 0 
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METHOD FOR PRODUCING A KNOWLEDGE BASED SYSTEM 



The present invention relates generally to the 
field of knowledge based systems; that is, com- 
puter systems that emulate reasoning tasks by 
using a so-called "inference engine" to interpret 
encoded knowledge of human experts stored in a 
"knowledge base". If the domain of the knowledge 
base, or scope of the problem, is sufficiently nar- 
row and a sufficiently large body of the knowledge 
is properly coded in the knowledge base, then the 
knowledge based system can achieve performance 
that matches or exceeds the ability of a human 
expert. In such a case, the knowledge based sys- 
tem is called an "expert system." More specifi- 
cally, the present invention relates to a method of 
converting both a knowledge base and an 
"inference engine" into program source code 
which may be compiled into object code for execu- 
tion by a computer, thereby increasing the execu- 
tion speed of the expert system. 

While knowledge based system technologies 
have shown promise in many useful applications, 
their use in some areas has been impaired by the 
limited performance of the presently known im- 
plementations. This is particularly true in real-time 
monitoring and control applications which must rea- 
son large amounts of data in a very short period of 
time and in embedded applications in which the 
knowledge based system must share the resources 
of the computer with a number of other applica- 
tions. 

While the performance of current knowledge 
based systems may be improved through the de- 
velopment of special hardware to support the rea- 
soning process, a major performance limitation for 
knowledge based systems on conventional hard- 
ware is due to the inherent interpretive nature of 
their implementations. Even when an "inference 
engine" is implemented in a compiled language, 
the general purpose design the inference engine 
requires the interpretation of data structures repre- 
senting the knowledge base. This process of inter- 
pretation is inefficient in comparison to compiled 
implementations. Compilation of the knowledge 
bases can be performed by restricting the inferen- 
cing techniques available to decision trees or other 
single paradigm reasoning strategies; however, this 
restriction limits the knowledge engineer's ability to 
represent and solve some classes of problems. 

Most knowledge based systems represent 
knowledge internally as collections of data struc- 
tures. The data structures are composed of nodes 
and links between the nodes. The nodes represent 
the rules and the tests within the rules. Included in 
the nodes are: how facts are tested and the results 
of these tests, how to acquire the facts, the oper- 



ations performed on the facts and the means of 
processing the results of the tests. The links repre- 
sent the way in which facts and the results of tests 
are combined into rule conditions, the references 

5 made to these tests and rule conditions by other 
rules in the knowledge base, and the actions to be 
performed when a rule is "true". The nodes are 
usually organized into trees or networks which are 
augmented with lists of references or current data 

70 values. These networks are traversed by the infer- 
ence engine to implement the reasoning process. 
At each node, the inference engine must examine 
the contents of the node to determine what data is 
involved and what tests need to be performed on 

is that data (test evaluation), which nodes to process 
next based on the results of these tests (test result 
propagation), and what actions to perform when a 
rule becomes true (performance of rule actions). 
Based on the results of this process the inference 

20 engine must then follow one or more links to the 
other nodes. This is the essence of the interpretive 
process. 

As an example of this, consider a simple back- 
ward chaining system which organizes its rules as 

25 a set of goal trees, each containing one or more 
rules that provide conditions and fact values con- 
cluding in a single goal. A particular set of such 
rules, and a goal tree that would result from them, 
is represented in Figure 1. The reasoning process 

30 for such a system would begin by selecting a goal 
and then "depth-first" traversing the respective 
goal tree to locate a fact that is not currently 
known. The value for the fact is then acquired by 
interpreting information in the fact node that in- 

35 dicates how to obtain the value. Next, test evalu- 
ation occurs by interpreting information in the test 
nodes that reference that fact. The results of these 
tests are then propagated up the goal tree by 
reversing the traversal process and interpreting 

40 each node to determine how the results of previous 
nodes are to be combined and passed to the next 
node in the tree. In addition to the interpretation 
that must be performed for test evaluation and 
result propagation, the tree traversal process must 

45 also be interpretive because connections are only 
known to the nodes which represent the trees. This 
form of backward chaining reasoning mechanism is 
similar to that of EMYCIN (Van Melle et al, "The 
EMYCIN Manual", Heuristic Programming Project 

so Report HPP-81-16 , Stanford University, 1981); PIE" 
(Burns et al., "The Portable Inference Engine: Fit- 
ting Significant Expertise into Small Systems", IBM 
Systems Journal , Vol. 25. No. 2, 1986), and exam- 
ple systems proposed by Winston and Horn ( LISP , 
2nd Ed., Chapter 18, "Expert Problem Solving us- 
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ing Rules and Streams," Addison Wesley. Reading. 
Mass.. 1984). 

Other known knowledge based systems are 
disclosed in the U.S. Patents Nos. 4,591,983; 
4.648.044 and 4,713,775. 

Forward chaining reasoning algorithms differ 
from backward chaining in that they work forward 
from facts to goals. The tree or network form of 
knowledge representation may still be used but the 
depth-first traversal to find a fact to evaluate is not 
performed. Instead, the system must search for 
facts that match rule conditions. Typically, this in- 
volves propagation of data values to the rules that 
use that data and evaluation of rule tests based on 
that data. This again involves interpretation of the 
test nodes to evaluate these tests with respect to a 
set of facts. After a test is evaluated, the results are 
propagated up the goal trees by interpreting the 
nodes at each step to determine how the results of 
previous nodes are to be processed, combined and 
passed to the next node in the network. This form 
of forward chaining reasoning mechanism is similar 
to that of example systems proposed by Winston 
and Horn ( LISP . 2nd Ed., Chapter 18, Addison 
Wesley, Reading. Mass., 1984) and has been high- 
ly optimized in OPS5 (Forgy, "Rete: A Fast Al- 
gorithm for the Many Pattern/Many Object Match 
Problem", Artificial Intelligence , Vol. 19, 1982). 

The interpretive traversal of these trees or net- 
works and the determination of the steps required 
to process each node degrades the performance of 
the reasoning process. At each node, the reasoning 
system must determine which of a possible set of 
steps are required and then perform those steps 
using information available at the node. Once the 
steps required to process a node have been per- 
formed, a determination of which- node(s) to pro- 
cess next must be made. The performance of 
knowledge based systems could be greatly im- 
proved if the interpretation process were eliminated 
and the equivalent steps were performed by con- 
cise, compiled program code. 

It is a principal objective of the present inven- 
tion to provide a technique or method that can be 
used to convert knowledge bases directly into com- 
pilable, high level language, program code provid- 
ing high performance using existing and proven 
reasoning algorithms. 

It is a further objective of the present invention 
to provide a method for the represention of knowl- 
edge bases in a program form and the generation 
of this form from the rules of a knowledge base. 

These objects, as well as other objects which 
will become apparent from the discussion that fol- 
lows, are achieved, according to the present inven- 
tion, by a method comprising the following steps: 

(a) Partitioning the network of/ nodes, and 
links between such nodes, into a plurality of rule 



trees or "sub-networks". 

(b) Labelling each node within each respec- 
tive sub-network with a unique identifier or index 
for subsequent reference. 
5 (c) Generating a segment of compilable, pro- 

cedural, program code which is equivalent to each 
respective sub-network when combined with the 
inferencing technique. This program code will here 
inafter be called the "inferencing code". The iden- 
w tifiers which were established in step (b) are used 
to locate the respective code for each node within 
each respective sub-network. This inferencing code 
comprises a node subsegment that implements the 
function of each node in each respective sub- 
T5 network based on the rules specified in the knowl- 
edge base, and it conditionally invokes other sub- 
segments using the identifiers. The inferencing 
code also comprises a control sub-segment which 
provides access to each node sub-segment using 
20 the identifiers and provides for repeated execution 
of each node sub-segment as necessary in order 
to effect the traversal of the network. 

(d) Generating for each data item a segment 
of compilable, procedural, program code for the 
25 distribution of such data item. This code, which will 
be hereinafter called the "distribution code", in- 
vokes the appropriate inferencing code by means 
of the identifiers when such data item is modified 
during execution of the knowledge based system. 

Thus the basic approach to rule compilation, 
according to the present invention, is first to deter- 
mine the steps that would be taken by an interpre- 
tive inference engine during the reasoning process 
then and to generate only the program code need- 
35 ed for these steps. Through analysis of the knowl- 
edge base, the set of possible steps for any given 
component of the knowledge base can be signifi- 
cantly reduced, often to a single step. This can 
then be converted to concise, efficient, procedural, 
40 program code to implement the reasoning process. 
Similarly, the steps that must follow this processing 
can also be identified so that additional program 
code can be generated. Using this approach, any 
interpretive inferencing program can be converted 
45 to a compiled system which avoids the interpretive 
steps. 

As may be seen, the method according to the 
present invention thus addresses the compilation of 
the test evaluation, the test result propagation, and 

so the performance of rule actions. These activities 
are common to forward chaining, backward chain- 
ing, as well as to mixed chaining (combined for- 
ward and backward chaining) reasoning processes. 
Since the evaluation and propagation steps are the 

55 - most frequently used operations in reasoning, their 
compilation will produce the most benefit in overall 
performance. Methods to improve the performance 
of the selection of facts to evaluate for backward 
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chaining reasoning are not specifically addressed 
hereinbelow, as they may be extrapolated from the 
forward reasoning process presented here. 

In order to perform rule compilation, the rules 
must be converted into a network representation 
with nodes for each rule component and links in- 
dicating the relationships of rule components to 
other rule components and facts. This is the same 
network that is used by the interpretive inference 
engine during execution of the knowledge base as 
in Figure 1. In addition, a mechanism must be 
developed to compile the traversal of the network 
for the propagation of results in order to preserve 
the dynamic nature of the knowledge based sys- 
tem. 

The compilation technique according to the 
invention first partitions the network of rules into a 
set of rule tree sub-networks. Although the sub- 
networks are considered to be trees, they need not 
be proper trees because they may contain referen- 
ces to other sub-networks within the tree. The 
criteria used for this partitioning is arbitrary. The 
partitioning can be performed on a rule basis or by 
groups of rules that use a common set of data. To 
maximize performance, the criteria should be 
based on minimizing references to other sub-net- 
works outside of each respective individual sub- 
network in order to reduce procedure call over- 
head. 

Next, the nodes of each rule tree (sub-network) 
are represented as parts of a case structure within 
a "while loop" controlled by a case index. Each 
part of the case structure consists of a segment of 
program code that performs the function of a node 
in the rule network generated from the knowledge 
base. Each of the program code segments con- 
ditionally passes control to its parent node in the 
tree through manipulation of the case index. Code 
segments may also pass other information, such as 
fact bindings or certainty factors, to their parents. 
The use of a tree structure guarantees that only 
one parent exists for each node in the rule tree, 
eliminating the need for multiway branching and a 
stack mechanism to maintain the branch informa- 
tion. Parents outside of the rule tree can be acces- 
sed by procedural invocation of the node in the 
outside rule tree. This eliminates the direct need 
for a stack to hold pending values as it is implicit in 
the procedural invocation mechanism. A stack may 
be required in situations where multiple data values 
must be passed to the parents individually. This 
need may also be eliminated by passing the data 
values as a list of values rather than as separate 
items. The use of the "while loop" permits propa- 
gation (execution of a series of code segments) to 
continue as far as possible on each invocation of 
the control structure. It also allows any node in the 
rule tree to be invoked on demand for additional 



flexibility in node evaluation (for example, when 
one part of an element in a structure of fact 
changes). 

The final stage of the control structure iden- 

5 tifies a rule as "concluded". This can consist of 
placing an entry in a conflict set to be selected for 
activation by a conflict resolution mechanism or by 
direct execution of the rule's action. 

Finally, procedural program code must be gen- 

70 erated to activate the rule trees based on changes 
to data items in the knowledge base. This program 
code is termed "distribution code" herein because 
its purpose is to "distribute" the modifications to 
the respective data items during execution of the 

is knowledge based system. Through analysis of the 
knowledge base, a list of rule trees and the nodes 
within those trees can be developed for each data 
item. From these lists, the distribution code can be 
generated to invoke the appropriate node of each 

20 rule tree when the data item is changed. The 
distribution code can be structured to invoke all 
nodes referring to a data item or only selected 
nodes. 

In some systems the identification of a data 

25 item to distribute is performed as part of a test on 
a generic data item or class that is implemented as 
an initial (implicit) test in the rule trees, (see Forgy, 
"Rete: A Fast Algorithm for the Many Pattern/many 
Object Match Problem", Artificial Intelligence , Vol. 

30 19, 1982) Since the exact use of each data item by 
rule tree can be predetermined through analysis of 
references in the rules, it is more efficient to di- 
rectly invoke rule tree structures, thus eliminating 
the need for this test. 

35 Using this implementation, the process of eval- 
uation and propagation is started by the distribution 
code when a data item is updated. The distribution 
code passes an index value for the node to be 
evaluated to the rule tree case structure. This 

40 causes selection and evaluation of the desired 
node. Depending upon the results of the evaluation, 
the index is reset to either the index value of the 
parent node to be evaluated or to the exit index. 
The process continues until the rule is concluded 

46 or all sub-conclusions that can be derived from the 
currently available information are exhausted. Re- 
ferences to a rule from other rule trees are imple- 
mented as invocations of the other rule tree case 
structures that pass the appropriate index for the 

so referenced entry. 



BRIEF DESCRIPTION OF THE DRAWINGS 

55 

Figure 1 (comprised of Fig. 1a and Fig. 1b) 
illustrates how rules of a knowledge base (Fig. 1a) 
can be represented by a "rule tree" network (Fig. 
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Figure 2 (comprised of Fig. 2a, Fig. 2b and Fig. 
2c) illustrates how rules of a knowledge base (Fig. 
2a) can be translated into a rule tree network (Fig. 
2b), and how this rule tree network can be con- 
verted into compilable, procedural, program code 
according to the present invention (Rg. 2c). 

A preferred embodiment of the present inven- 
tion will now be described with reference to Fig. 2 
of the drawings which illustrates a simplified exam- 
pie of the inventive method. 

Fig. 2a shows a rule that forms a simple knowl- 
edge base. Fig. 2b shows the internally generated 
rule tree resulting from this rule. Fig. 2c shows a 
simplified version of the program code resulting 
from this knowledge base to perform the distribu- 
tion and propagation functions. Whenever the val- 
ues of A or B are changed, the A-DIST: or B-DIST: 
procedures invoke the rule tree procedure RULE1 
(i.e. "R1") with the appropriate index value to 
cause evaluation of the tests. The results of the 
tests are then propagated up the tree as far as 
possible. Details of the code to evaluate the AND 
condition are omitted from this example. This func- 
tion must determine whether all of the required 
terms are evaluated. The result is used to either 
pass control to the next node-segment or set the 
index to 0 to terminate processing until all terms 
have been evaluated. There are many possible 
ways to perform this function such as direct testing 
of status indicators, searching a list of indicators, or 
using evaluation count mechanisms. Program code 
to manage and pass data values to parent nodes 
has also been omitted from this example for sim- 
plicity. 

The conversion of the rules in the knowledge 
base into compilable, procedural program code ac- 
cording to the present invention is therefore ac- 
complished by the following steps: 



Step 0: 

Put the knowledge base in a network form as is 
used for interpretive inferencing. This step is per- 
formed in most current knowledge based system 
implementations. The network should represent 
rule components (tests, logical operators, actions, 
facts items) as nodes and the relationship of the 
rule components to each other and to facts in the 
knowledge base as links. 



Step 1: 

Convert the knowledge base into a forest of 
rule trees. This requires the application of some 
criteria for partitioning the rule network into rule 



tree sub-networks which have a single root. A mini- 
mum number of references out of each sub-net- 
work is desirable for performance reasons. The 
most convenient criteria for the partitioning in most 
s systems is that each rule or end of an inference 
chain should form a rule tree with the rule's action 
becoming the root. 

io Step 2: 

Depth-first traverse each resulting rule tree 
(sub-network) starting at the root and label each 
node sequentially with an identifier or index starting 
75 with "1". This step will uniquely identify each node 
within a rule tree and provide the index value for 
the case structure. Nodes of rules that are not in 
the current rule tree are not numbered. 

20 

Step 3: 

Depth-first traverse each rule tree and generate 
a procedure for the rule consisting of a prologue 

25 (procedure header, initialization, while and case 
statements), a case part for each node that imple- 
ments the function of that node based on the rules 
specified in the knowledge base, conditional code 
to reset the index to the next node to be evaluated 

30 under the proper conditions, and an epilogue. Case 
index values are obtained from the numbering of 
Step 2. Nodes that are referenced by other rule 
trees should contain calls to the referencing rule 
procedures passing the index value for the refer- 

35 ence point and any associated fact bindings or 
certainties. The index value is obtained by using 
the link to the reference point. Rule actions that 
update fact values should also propagate those 
values to referencing rules through calls to the 

40 appropriate distribution procedures. 



Step 4: 

45 For each data item in the knowledge base, 

generate a distribution procedure consisting of a 
prologue, a code to call each rule procedure which 
references the data item, and an epilogue. The call 
will pass the index for the test node to be executed 

so in the rule procedure. 

This method according to the present invention 
generates a set of rule processing and data dis- 
tribution procedures that are specific to the knowl- 
edge base. The method combines the specific 

55 rules of the knowledge base with the inferencing 
logic to produce a unique code module. Hence, the 
inference engine is no longer physically separated 
from the rules as is the case with prior known 
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knowledge based systems; instead it has been 
optimally merged with the rule code. Despite the 
lack of a physically distinct inference engine, the 
functionality of the conventional approach (separate 
inference engine and knowledge base) has been 
preserved since the resulting modules take the 
same actions that the inference engine would have 
taken. 

The processing performance gains obtained by 
the compilation technique according to the present 
invention depend heavily on the complexity of rule 
language, the amount of interpretation done at ex- 
ecution time by the inference engine, and the un- 
derlying algorithms used. The performance gains 
occur at the points at which selections must be 
made between alternative paths or processing 
steps. Knowledge representation languages which 
offer a richer set of features will encounter more 
alternative paths during processing and can be 
expected to experience greater performance gains. 

The implementation of rules and the inference 
engine as procedural, program code provides addi- 
tional efficiencies of implementation beyond the 
removal of the interpretive overhead. Facts and 
knowledge base control blocks can be represented 
as program variables and referenced directly rather 
than using indirection, linked lists or other dynamic 
structures that require search or more complicated 
traversal. Similarly, expressions used in rule con- 
dition tests and action statements can be imple- 
mented efficiently in the code rather than as com- 
plex interpretive structures. Interpretive inference 
engines cannot take advantage of these implemen- 
tations because they must be generalized for all 
possible knowledge bases and structures. 

Additionally, the conversion of knowledge 
bases into program code improves the embed- 
dability of the resulting system. The compiled 
knowledge base is compatible with any environ- 
ment that the language that is generated is com- 
patible with. 

The program code generated by the compila- 
tion technique according to the present invention 
will potentially require more storage than the cor- 
responding data structures used by interpretive 
techniques. This is because the generalized logic 
used by the interpretive approach is customized 
and repeated in each node. This type of tradeoff 
(more storage for lower processing requirements) 
is typical of processing optimization techniques. 
The actual difference in storage size will again 
depend on the complexity and features of the rule 
language. The amount of storage required will gen- 
erally be directly proportional to the amount of 
storage required by the data structure form of the 
knowledge base; combinatorial explosions resulting 
from rule and data interactions will not occur. This 
is because the code is generated from an internal 



representation of the knowledge base that is similar 
to that used by an interpretive form of the same 
knowledge base. 

The size of the generated code can be re- 

5 duced through the proper use of utility procedures. 
The compilation of the knowledge base may still 
result in some commonly used sequences of code. 
Any code sequence that does not incorporate code 
derived from knowledge base specifications can be 

io considered for implementation as a utility proce- 
dure. 

It should also be noted that the program code 
generated using the compilation technique accord- 
ing to the present invention includes the logic for 

;s the inference engine whereas the comparable data 
structures used by the conventional approach do 
not. To obtain truly comparable numbers, the size 
of the inference engine software in conventional 
techniques must be considered along with the size 

20 of the data structures representing the knowledge 
base. 

An alternative approach to knowledge base 
compilation is to represent the rule network as a 
set of decision trees which can then be repre- 
ss sented by a series of nested IF statements. This 
approach has been used in a number of systems 
such as compiled versions of EMYCIN (Van Melle 
et aL, "The EMYCIN Manual"; Heuristic Program- 
ming Project Report HPP-81-16 , 1981). However, 
30 this approach effectively fixes the order of evalu- 
ation of the parameters involved (based on their 
values) and does not provide access to individual 
nodes on demand. Since access to individual 
nodes is not provided, this approach does not 
35 support techniques such as dynamic rule ordering 
based on confidence or forward chaining reason- 
ing. In addition, this approach is specific to the 
inferencing provided by these systems and may 
not be applicable to other inferencing paradigms. 
40 Another approach is to compile the conditions 

and actions of rules into program code but retain 
the inference network as an interpreted data struc- 
ture. This technique has been used by Knowled- 
geTool (a trademark of IBM Corp.) and ETC 
45 (Drastal et al, "Economy in Expert System Con- 
struction: The AEGIS Combat System Maintenance 
Advisor", SPIE Applications of Artificial Intelligence 
Conference , 1986). This approach only improves 
the performance of condition evaluation and action 
so execution and does not address the issue of rule 
result distribution which is central to the inferencing 
process. 

The compilation technique according to the 
present invention has certain similarities to an ap- 
55 proach to the compilation of RETE networks sug- 
gested by Forgy, "Rete: A Fast Algorithm for the 
Many Pattern/Many Object Match. Problem," Artifi- 
cial Intelligence, Vol. 19, 1982. However, Forgy's 
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approach is specific to RETE networks and in- 
volves the use of branching (GOTOs) and a stack 
to retain alternatives. The use of branching and the 
lack of a distribution procedure could require large 
code segments to implement the entire RETE net- 
work which could exceed the limit imposed by 
language compilers. The technique according to 
the present invention produces structured code 
without GOTOs and does not require an explicit 
stack. The production of structured code allows 
rules to be implemented in smaller modules to 
avoid the restrictions of high level language compil- 
ers and to provide possibilities for segmentation of 
knowledge bases in storage for operations on small, 
systems. 



Conclusion : 

A method for the compilation of knowledge 
bases has been presented. This method provides a 
performance improvement over the conventional 
interpretive approach that varies depending on the 
complexity and features of the rule language. This 
method is superior to the decision tree generation 
approach because it allows direct access to each 
node in the resulting rule networks supporting for- 
ward chaining reasoning and allows the use of 
techniques such as dynamic rule ordering. The 
method also goes beyond approaches which com- 
pile only the condition tests and action statements 
of rules because it also compiles and optimizes the 
inferencing mechanism. 



Claims 

1. A method of converting both a knowledge 
base and an inferencing technique into compilable 
program code forming a knowledge based system, 
said knowledge base including rules and data 
items, said rules being arranged in a network of 
nodes and links between such nodes, said nodes 
representing tests, logical operators, actions and 
data items, said network being in a form convenient 
for interpretive inferencing, said method comprising 
the steps of: 

(a) partitioning said network into a plurality of 
sub-networks; 

(b) labelling each node within each respec- 
tive sub-network with a unique identifier; 

(c) generating a segment of compilable, pro- 
cedural, program code for inferencing which is 
equivalent to each respective sub-network when 
combined with said inferencing technique, said 
identifiers locating the respective code for each 
node within each respective subnetwork, said in- 
ferencing code comprising a node sub-segment 



that implements the function of each node in said 
respective sub-network based on the rules speci- 
fied in said knowledge base and that conditionally 
invokes other sub-segments using said identifiers, 

5 and a control sub-segment which provides access 
to each node sub-segment using said identifiers 
and provides for repeated execution of each node 
sub-segment as necessary; and 

(d) generating for each data item a segment 

w of compilable procedural, program code for the 
distribution of such data item which invokes the 
appropriate program inferencing code by means of 
said identifiers when such data item is modified 
during execution of the knowledge based system. 

is 2. The method defined in claim 1 , further com- 
prising the initial step of: 
forming said network from said knowledge base. 

3. The method defined in claim 1 , wherein step 
(a) includes the step of selecting rule trees in said 

20 network as the respective sub-networks, with each 
rule tree's action, which is the root of the rule tree, 
becoming the start of the respective sub-network. 

4. The method defined in claim 1 . wherein step 

(a) includes the step of selecting nodes in said 
25 network as the respective sub-networks such that 

the references to nodes outside said sub-networks 
are minimized. 

5. The method defined in claim 1 , wherein step 

(b) includes the step of depth-first traversing each 
30 sub-network, starting at the end of such sub-net- ; 

work, and labelling each node in such sub-network 
sequentially. 

6. The method defined in claim 5, wherein said 
nodes in each sub-network are labeled sequentially 

35 in consecutive order. 

7. The method defined in claim 6. wherein said 
nodes of each sub-network are labeled sequentially 
in consecutive order starting with the number "1". 

8. The method defined in claim 1 , wherein step 
40 (c) includes the step of depth-first traversing each 

sub-network and generating a segment of com- 
pilable, procedural, program code for inferencing 
which is equivalent to such sub-network, .said in- 
ferencing code comprising (1) a prologue consist- 

45 ing of initialization, while and case statements, (2) a 
case part for each node that implements the func- 
tion of that node based on the rules specified in the 
knowledge base and conditional statements to re- 
set the identifier to the next node to be evaluated, 

so and (3) an epilogue comprising said control sub- 
segment. 

9. The method defined in claim 8, wherein 
node sub-segments that are referenced by the 
program code for other sub-networks contain calls 

55 to the referencing rule procedures that pass the 
identifier for the reference point and any data asso- 
ciated therewith. 

10. The method defined in claim 8, wherein 



13 EP 0 355 287 A2 14 



node sub-segments contain calls to utility software 
to implement functions whose logic is not derived 
from said knowledge base. 

11. The method defined in claim 1. wherein 
step (c) includes the step of generating a segment 5 
of inferencing code which includes calls to said 
distribution code for propagation of updated data 
items. 

12. The method defined in claim 1, wherein 

step (d) includes the step of generating said dis- w 
tribution code consisting of a prologue sub-seg- 
ment, a subsegment to call each rule procedure 
which references the data item, and an epilogue 
sub-segment. 

13. The method defined in claim 12, wherein ;s 
said call sub-segment passes the identifier for the 

test node to be executed by the inferencing code. 
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FIG.I 

EXAMPLE RULES AND THEIR INTERNAL REPRESENTATION: 



FIG.I a 

RULES 

IF Q OR Z>I22 THEN ACTION I 
IF X = l AND Y<IO THEN Q 



FIG. lb 

INTERNAL RULE TREE 



goal F— Taction i 1 



FIG.2 

THE CONVERSION OF RULES INTO CODE: 




FIG.2a 

KNOWLEDGE BASE 
IF A> 10 AND B <0 
THEN DISPLAY A 



FIG.2b 

RULE TREE 

| RULE | 

ACTION I 



/ 


DISPLAY A | 


AND 


- 2 



AND - 2 
l 10 B 0 



NOTE: N = CASE /NODE INDEX 



FIG.2C 

GENERATED CODE 

Rl: PROC (N) 

WHILE (N<>0) 
CASE (N) 

I: DISPLAY A 
Z- IF LEFT INPUT TRUE 
AND RIGHT INPUT TRUE 

THEN N=l 

ELSE 

N=0 

3= IF A>IO THEN 
N=2 
ELSE 
N=0 

4: IF B<0 THEN 
N=2 
ELSE 
N=0 
END 

END 

A- DIST= PROC 

CALL Rl (3) 

II II 

END 
B- DISTs PROC 

CALL Rl(4) 

it 

END 
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© A method is disclosed for converting both a 
knowledge base and an inferencing technique into 
compilable program code forming a knowledge 
based system. The method is based on determining 
what actions an interpretitive inference engine would 
take with the specific knowledge base and generat- 
ing only the program code needed to perform these 

FIG.2 

THE CONVERSION OF RULES INTO CODE: 



FIG.2a 



KNOWLEDGE BASE 
IF A> IO AND B <0 
THEN DISPLAY A 



FIG.2b 




NOTE 1 N'CASE /NODE IN0EX 



actions. This method eliminates the overhead of 
interpreting a representation of the knowledge base 
and significantly improves performance of the sys- 
tem. The method is applicable to both forward and 
backward chaining reasoning strategies and does not 
modify or restrict the functional capabilities of the 
knowledge based system. 



FIG.2c 

GENERATED CODE 

Rl« PROC (N) 

WHILE (N<>0) 
CASE <N) 

I- DISPLAY A 
2* IF LEFT INPUT TRUE 
AND RIGHT INPUT TRUE 

THEN N»l 

ELSE 

N«0 

3* IF A>IO THEN 
N»2 
ELSE 
N-0 

4«IF B<0 THEN 
N«2 
ELSE 
N»0 
ENO 

ENO 

A- 0IST: PROC 

CALL Rl (3) 

ENO 

e- oist: proc 

CALL Rl(4) 
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